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Ch ■ Abstract 

^' 

We extend the multiplicative submodularity of the principal determinants 

O^ ■ of a nonnegative definite hermitian matrix to other spectral functions. We 

show that if / is the primitive of a function that is operator monotone on 

an interval containing the spectrum of a hermitian matrix A, then the function 

p""' 1 1-)- tr/(A[/]) is supermodular, meaning that tr/(A[J]) +tr/(A[ J]) < tr/(A[/U 

rv^ . J]) + tr/(A[/ n J]), where A[I] denotes the I x I principal submatrix of A. 

We discuss extensions to self-adjoint operators on infinite dimensional Hilbert 

space and to M-matrices. We discuss an application to CUR approximation of 



2 ' nonnegative hermitian matrices. 
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_ • ■ 1 Introduction and statement of the main result 

(^ I Let ?TT, be a positive integer and denote [m] := {1, . . . ,m}. A real valued function 

w : 2[™1 -^ M defined on all subsets of [m] is called nondecreasing if w{I) ^ w{J) 
when / C J C [m]. It is submodular if 

w{I) + w{J) ^ w{I U J) + w{I n J) 

for any two subsets /, J of [m]. The function w is called nonincreasing or supermod- 
ular whenever —w is nondecreasing or submodular, respectively. A function that is 
both submodular and supermodular is called modular. 

The importance of submodular functions in combinatorial optimization is well 
known. Several polynomial time algorithms to minimize a submodular function un- 
der a matroid constraint are known, we refer the reader to the survey |lwa08| for 
more information. The maximization of a submodular function under a matroid con- 
straint, and specially, under a cardinality constraint, Ukiw) := max/(-[^] |7[^;jZ/;(I), 
is also of great interest. For some submodular functions w the latter problem is 
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NP-hard. However, a classical result |NWF78] shows that when w is nondecreas- 
ing and submodular, the greedy algorithm allows one to compute an approximation 
v^{w) of Vk{w) which is such that i^^{w) ^ {1 — e~^)i'k{w) . See |CCPV07j for recent 
developments regarding submodular maximization. 

Denote respectively by C™"^™ D H^ D }i„i{£) the space of m x m complex 
valued matrices, the space of m x m hermitian matrices, and the subset of A G H^ 
whose eigenvalues lie in the interval £" C M. For / C [m] denote by A[I] the 
principal submatrix of A, obtained from A by deleting the rows and columns in the 
set [m] \ I. For A,B (^ H^ denote by ^ the Loewner ordering, so that A ^ B if 
A- B e H^([0, oo)). We also write A^B or AyB if A-Be H^([0, oo)) \ {0} 
on: A — B G Hm((0, cxd)) respectively. 

Recall that the principal minors of a nonnegative definite matrix satisfy the 
multiplicative submodularity property: 

det^[/U J]detA[/n J] ^ detA[/]detyl[J], where/, Jc[m], A e Hm([0, oo)). 

(1.1) 
In other words, the function log(-. A) : 2'™' — )■ M given by 

log(/,^) := logdet A[/], lc[m], Ae H™((0,oo)) (1.2) 

is submodular. This inequality has arisen in the work of several authors. It goes 
back to Gantmacher and Krein |GK60] and Kotelyanskii [Kot50| , see the discussion 
by Ky Fan |Fan67[ IFan68] . It can also be found in |KK83[ [JB85] . The classical 
Hadamard-Fischer inequality for the principal minors of nonnegative definite ma- 
trices is obtained when / n J = 0, understanding that det A[0] = 1. We refer the 
reader to [FJOOj for a survey of determinantal inequalities, and to |CT88j for their 
relation with information theory. It is well known that the inequality (jl.ip hold also 
for M-matrices, e.g. jCar67j . 

In this paper, we derive a general submodularity result for spectral functions 
of hermitian matrices, for some p-trace of M-matrices and some extensions to self 
adjoint positive operators on infinite dimensional separable Hilbert spaces. 

We first summarize our results on hermitian matrices. Recall that if A = UDU*, 
where U is unitary and D = diag(Ai, . . . , Am) is the diagonal matrix with the eigen- 
values of A on the diagonal, the matrix f{A) is defined to be U diag(/(Ai), . . . , f{Xm))U* 
Note in particular that tr f{A) = /(Ai) + • • • + f{Kn)- 

Recall that a real function / is operator monotone on the interval £^ C M if for 
all m ^ 1 and for all A,B (^ Hm(f ), 

A^B ^ f{A) < f{B) . 

An operator convex function on the interval £ is defined by requiring that 

tf{A) + {l-t)f{B)hf{tA+{l-t)B) 

for each t G [0, 1]. A function / is operator antitone (resp. operator concave) on an 
interval if — / is operator monotone (resp. operator convex) on the same interval. 
Operator monotone and operator convex functions are characterized by Loewner 
theory [Low34] . In particular, integral representations like the one of (j2.ip below 
are known. We refer the reader to [Bha97] for more background. Matrix inequalities 
involving operator monotone functions can be found in |AZ99l IZha02| . 
The main result of this paper is the following. 



Theorem 1 Let f be a real continuous function defined on an interval £ of M, 
and assume that f is the primitive of a function that is operator monotone on the 
interior of 8. Then, for every m x m hermitian matrix A with spectrum in £, the 
function 2^ ^ R 

I^tTf{A[I]) 

is supermodular. 

It is understood that tr/(74[0]) := for every function /. 

Recall that the primitive of an operator monotone function is operator convex 
on the same interval, but that not all operator convex functions are obtained in this 
way. We shall see that the conclusion of this theorem no longer holds if / is only 
assumed to be operator convex. 

Let us mention some immediate applications. The derivative of the map t^, 
namely, pt^~^, is known to be operator antitone on (0, cxo) for < p ^ 1, and 
operator monotone on [0, oo) for 1 ^ p ^ 2. The derivative of the map tlogt, 
namely, 1 + logt, is also known to be operator monotone on (0, oo). (See [Bha97j .) 
Hence, the next result readily follows from the main theorem. 

Corollary 2 Let A be a m x m nonnegative definite hermitian matrix. Then, 
for all /, J C [m], 

tT AlIjP + tr A[J]P ^ trA[/U J]P + tr^[/n J]P, for O^p^l, (1.3) 
ti A[I]P + tr A[J]f ^ ti A[I U J]P + tr A[I n J]P , for l^p^2, (1.4) 



and 



tr {A[I] log A[I]) + tr {A[J] log A[J]) ^ (1.5) 

tr {A[I U J] log A[I U J]) + tr {A[I n J] log A[I n J]) . 
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We write A[I]'P for (^[7])^. It is understood that A^ := limpx^o^^? foi' every non- 
negative definite matrix A. In particular, A^ is the identity matrix Id if A is positive 
definite, and, if A is nonnegative definite, tr A^ = rank A. 

We now survey briefly the contents of our paper. In §2 we prove Theorem [TJ In 
§3 we discuss M-matrices. We show the inequalities (jl.ip . (|1.3p and (jl.Sp . Moreover 
tr A[I]P is a supermodular function for each p < 0. In §4 we discuss the extensions of 
Theorem[T]to the space S('H) of self adjoint operators on a separable Hilbert space H. 
We consider the lattice of closed subspaces U m Ti, with joint and meet operations 
clo(U + V) and U n V, where clo denotes the closure of a set, together with its 
sublattice of finite dimensional subspaces Uj. Let P(U) € SCH) be the orthogonal 
projection on U € ^. Associate with each JJ £ U the operator ^(U) which is the 
restriction of P(\J)AP(U) to U. Thus, ^(U) is an analog of B[I] for a hermitian 
matrix B. For U G Z^// it is straightforward to show that w(\J) := tr/(^(U)) is 
submodular under the conditions given in §2. For an infinite dimensional subspace 
U, we restrict out discussion to Wp{\J) := ti{A{\J)y and the von Neumann entropy 
w(\J) := — tr(^(U) log ^(U), under the assumption that A is positive and compact. 
Assuming that tr^^ < cx) we show that Wp{\J) is submodular for p £ (0,1) and 



supermodular for p € (1,2). We show that w is submodular it —ti{AlogA) < oo. 
In §5 we discuss the CUR approximation [GT Z97] of nonnegative definite hermitian 
matrix. The main problem here is to find a good approximation to the maximum of 
tr(Iog^[/]) on all subsets / of [m] of cardinality k. We discuss briefly the obvious 
greedy algorithm for this problem, give a simple condition where tr(log^[/]) is 
nondecreasing, and give an estimate for the CUR approximation obtained by the 
greedy algorithm in the general case. In §6 we give examples to show that in general 
the results of §2 are best possible. 

Note added to the arXiv postprint. The authors thank D. Petz for having brought 
to their attention his work with K. Audenaert and F. Hiai (Strongly subaddtive func- 
tions, Acta Math. Hungar. 128(4) :386-394, 2010), after this paper was published on 
line in Linear Algebra Appl. (December 2011, doi:10.1016/j.laa.2011.11.021). Theo- 
rem 1 of this paper is a slightly more general version of their Theorem 4.1 (the latter 
corresponds to the case in which the interval is (0, oo)). Some results of the present 
Section 4 concerning the finite dimensional case (Theorem 5) can also be thought of 
as an extension of theirs. Their work is motivated by quantum information theory. 

2 Proof of Theorem [1] 

Observe first that if all the eigenvalues of A belong to the interval £, so do the 
eigenvalues of the principal submatrix A[I]. Hence, the function / i— > tr/(A[I]) is 
well defined for all / C [m]. 

It suffices to consider the case in which the interval £ is bounded. Moreover, if 
the result is established for every matrix A the spectrum of which is included in the 
interior of £, arguing by density, the result must also hold whenever the spectrum 
of A is included in £. Hence, we may assume that £ is open. Since the property to 
be established is invariant by a translation and a scaling of the interval £, we finally 
assume that £ = (—1, 1). Then, a theorem of Loewner (Corollary V.4.5 in |Bha97] ) 
shows that every operator monotone function g on ( — 1,1) can be written as 



9{t) = 5(0) + g'iO) I ^ Y^Ai'^''^^^ ^^-^^ 



where /x is a probability measure on [—1, 1]. Moreover, ^'(0) ^ 0. 
Assume now that / is a primitive of g, so that 

f{t)=a + tg{0)+g'{0)l ds j -^-±^^d^l{\) , 

for some constant a. Observe that for \t\ < 1, the denominator 1 — As is bounded 
below by 1 — [t|, and so, the above double integral is absolutely convergent. Applying 
Fubini's theorem, we get 



fit) 



where 



a + tgiO) + g'iO) [ (i/i(A) / -^ds 
J-i Jo 1 - As 



a + tg{0)+g'{0)l ^{X,t)dfi{X) 



ip{X,t):=-j-^logil-Xt) . 



Although A appears at the denominator, this expression defines a function of (A, t) 
that extends continuously to [—1, 1] x (—1, 1). (In particular, (p{0,t) = ^t"^-) 
For all A G [—1,1], consider now the functions w and wx from 2'-"^' — t- M, 

Wx{I):=tTip{X,A[I]) , w{I):=tr{ald + g{0)A)[I] , 

so that 

tvf{A[I]) = w{I)+g'{0)l wx{I)df,{X) . (2.2) 

Observe first that the function / i— )• — i^trA[/] is modular, for all A 7^ 0. The 
eigenvalues of the matrix Id — XA belong to the interval [1 — A, 1 + A], which im- 
plies that this matrix is positive definite. Since the sum of supermodular functions 
is supermodular, it follows from the multiplicative submodularity property of the 
determinant, Eqn (II. ip . that 

wx{I) = -\trA[I] - J^log(/,Id - XA) 

is supermodular, as soon as A 7^ 0. Since wx{I) depends continuously of A, the same 
is true when A = 0. Note finally that the map id is modular. Since the supermod- 
ularity property is preserved by taking positive linear combinations and integrals 
with respect to positive measures, the result follows from the representation ()2.2p . □ 



3 Submodularity and super-modularity inequalities for 
M-matrices 

Denote by M™^™ the set of nonnegative matrices. For A, B £ ]^™xm -^q denote 
A^ B if A-B eR'^'"'^. We also write yl > B and A > BifA-B eRf''\{0} and 
A — B has positive entries respectively. For B € M™^™- denote by p{B) the spectral 
radius of B. The Perron- Frobenius theorem yields that p{B) is an eigenvalue of a 
matrix B G M™''"'. Recall that A € R'^x"' is called an M-matrix if all oflF-diagonal 
entries of A are nonpositive and all the principal minors of A are nonnegative. 
Equivalently, A is an M-matrix if ^ = sId - i? for some B G M™^"" and s ^ piB). 
In this section we assume that A is an M-matrix and B € R^j^^" unless stated 
otherwise. Note that A = sId — i? is invertible if and only if s > p{B). Assume that 
A is an invertible M-matrix. Then for any p G M we define 

i = ^^^ 4 = 1 ^^ 

Since s > p{B), a standard argument shows that the above series absolutely con- 
verges. When A is singular, we define, for every p ^ 0, ti A^ := limj_^^+ tr{tld — B)P. 
Indeed, one readily checks that the spectrum of the matrix (tid — B)p as a point- 
wise limit as t — 7- s+, if p ^ 0, so that limit of tr(tld — i?)^ as t — t- s"*" does exist. 
(However, we warn the reader of the abusive character of the notation trA^: the 
matrix A^ := lim(_^5+ (tid — B)P may not exist if is not a semi-simple eigenvalue of 
A.) Since p{B[I]) ^ p{B) for any / C [n], it follows that A[I] is also an M-matrix. 



Furthermore, if A is invertible then A[I] is invertible. We finally agree that if A is 
singular then tr log A = —oo and tr A^ = oo ii p < 0. 
The main result of this section is the following. 

Theorem 3 Let A be an M -matrix. Then, for all /, J C [m], 

tTA[I]P + trA[J]P ^ tTA[IUJ]P + trA[InJ]P, for O^p^l, 

ti A[I]P + tr A[J]P ^ tvA[IUJ]P + trA[Ir\J]P, for p<0 or 1 s^ p ^ 2 , 

and 

tr {A[I] log A[I])+ tr {A[J] log A[J]) s^ 

tr(^[/U J]log^[/U J]) +tr(^[7n J]log^[/n J]) . 

The proof of this theorem relies on the following lemma. 

Lemma 4 For any B eR'^'''^ 

tr B[ir + ti B[jr ^ tv B[I [J Jr + tr B[I n J]"" 

for every integer n ^ 1, and the equality holds for n = 1. 

Proof. Consider the complete directed graph with m nodes. Define a closed 
walk of length n to be a sequence a = (ii, . . . ,in+i) of elements of [m] such that 
in+i = ii- We say that a is included m I, and we write a C /, if ii, . . . , in+i £ I- The 
weight of this walk is |a| := Bi-^i^ . . . Bi^i^^^^ . By a classical result |Sta971 Th. 4.7.1], 
tr B[I]"' is the sum of the weights of all closed walks of length n included in I. By 
a disjunction of cases, we deduce that 

tr5[/UJ]" = ^\a\ + ^\a\ + ^\a\+ ^ \a\ , (3.1) 



aCl 


acl 


aCJ 


aClUJ 


OL^J 


aCJ 


a<^I 


a(tJ 



with the convention that all sums are restricted to the walks a of length n. Adding 
the two first sums in (13. ip yields tri3[I]'^. Moreover, adding tri?[/n J]" to the third 
sum in (jS.ip yields tri?[J]". We conclude that 

tr B[I yj Jf +tr B[Ir\Jf -tr B[If -trBlJY" = ^ |a| ^ , 

aC/UJ 

since the entries of the matrix B are nonnegative. Moreover, when n = 1, the latter 
sum trivially vanishes. □ 

Proof of Theorem |4l Suppose first that s > p{B). We first consider the 
function tr74[I]P. We have 



where 



tr A[I]P = sPtrId[/] - sP+^ptrB[I]+w'iI) 
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Observe that for p G (—00, 2], all the coefficients of the latter sum have the same 
sign (or vanish), and this sign is positive if p € (— oo,0) U (1,2) and negative if 
p G (0, 1). Hence, it follows from Lemma H] that the function w' is supermodular in 
the former case and submodular in the latter case. Since / 1— ?> tr74[/]^ is the sum 
of the modular function / 1— ?> s^trld[/] — s^~^^ptr B[I] and of w', the announced 
submodularity and supermodularities of the map / 1-^ tr A[I]p hold for this matrix 
A. 

Similarly, since — (1 — x) log(l — x) — x has negative Taylor coefficients, it follows 
as above that — tr(j4[/] log j4[/]) is submodular. 

Assume now that s = p{B), i.e., that ^ is a singular M-matrix. Let At = tld — B 
for t > p{B). Letting t \ p{B) we deduce all the above inequalities for A except 
for p = 0. Letting p \ we deduce that tr ^[1]" is a submodular function. (Note 
that we cannot use p < since tx A^ in this case in 00.) □ 

Note that since log(l — x) has negative Taylor coefficients, it follows as above 
that the function / 1-^ tr(log A[/]) is submodular, i.e., we have obtained yet another 
proof of the inequality (II. ip for M-matrices |Car67] . 

4 Submodular functions on the lattice of closed sub- 
spaces of a Hilbert space 

Let T-i he a. separable Hilbert space over C with the inner product (x, y) and the 
norm ||x|| = y'^x^x). Denote by '&{'H) D S+('H) the space of bounded self adjoint 
linear operators A ■.'H ^ % and the cone of bounded positive self adjoint operators. 
So (Ax, y) = (x. Ay) for A e ^{Wj and (Ax, x) ^ for A G S+{U) for all x, y G 7^. 
For A G S('H) the operator norm \\A\\ is given by sup||x||<;i |(Ax,x)|. More precisely, 
the spectrum of A G S{'H) lies in the interval [0(^4), h{A)] where 

a{A)= inf (Ax,x), h{A) = sup (Ax, x), ||A|| = max(|a(A)|, |6(A)|). (4.1) 
l|x|Kl ||x!|s:i 

Assume that A G SiTi). We first discuss the analog of A[I] when I is a finite 
subset of N of cardinality /. Let U be an /-dimensional subspace of Ti. Denote 
by P(U) G S+{n) the orthogonal projection on U. So 7^ = U IJ-^ = U 
(Id - P(U))n. Consider the operator P(U)AP(U). Clearly P(U)AP(U)U^ = 
{0}, P(U)AP(U)U C U. Denote by A(U) the restriction of P(U)AP(U) to U. As 
||Px|| ^ ||x|| for X G ^ the characterization (14. Ih yields 

a{A) ^ a(A(U)) ^ b{A{V)) ^ b{A). (4.2) 

In particular, if the spectrum of A lies in a given interval £, so does the spectrum 
of A(U). 

Let V be another finite dimensional vector space of ^. Then A(U), A(V), A(U-|- 
V), A(U n V) are the analogs of A[/], A[J], A[/ U J],A[I D J] respectively. 

Indeed, let W = U + V and choose an orthonormal basis W := {xi, . . . , x^} in 
W, such that X := {xi, . . . , Xg} is a basis in U, Z := {xp, Xp+i, . . . , x^} is a basis 
in U n V and Y := {xp,Xp_|_i, . . . ,Xm} is a basis in V. It is easy to see that the 



operators A(W) , A(U) , A(V) , A(U n V) are represented in the above bases by the 
following matrices 

AiW) = [(Ax„x,)]-,=i, AiX) = [(^x^x,)]" 






AiY) = [{A^,,^j)]Zi=„ A{Z) = [(^x„x-\i'' 



Denote hy U D Uj the set of all closed subspaces of H and the subset of all finite 
dimensional subspaces, respectively. A function w '.Uf ^-M. is said to be submodular 
if w(U) + tt;(V) ^ w(U+V)+i(;(UnV) for all U, V € Uj. It is super-modular ii —w is 
submodular, and modw/ar if it is both submodular and supermodular. The following 
result is an immediate consequence of Theorem [1] and of the previous observations. 

Theorem 5 Let f be a real continuous function defined on an interval £ of M, 
and assume that f is the primitive of a function that is operator monotone on the 
interior of £. Let H be an infinite dimensional separable Hilhert space space. Then, 
for every A G S('H) with spectrum in £, the function w : Uf ^ M. given by 

U^tr/(A(U)) 

is supermodular. □ 

A function w : Uf ^ M. is extendable to U if the following condition holds for 
each closed infinite dimensional subspace U C "W. Let Uj,z ^ 1 be an increasing 
sequence of finite dimensional subspaces of U such that U^^U, is dense in U. Then 
the sequence w{\Ji),i ^ 1 converges to a unique value independent of the sequence 
Ui,i ^ 1. We denote this limit by w(U). 

In the rest of this section, we discuss the extension of certain submodular and 
supermodular functions of a subspace U, arising from Theorem [5l 

Denote by CS+(T-L) C S+('H) the closed ideal of positive compact operators. For 
A G CS+(?^), the Hilbert space Ti has a countable orthonormal basis ui, U2, . . . such 
that 

Am = Xi{A)u„i ^ 1, Ai(^) ^ X2{A) ^ . . . ^ 0, lim X,{A) = 0. 

We now state some known results about characterizations of eigenvalues of ^ G 
S+CH) that we will use here. Consult with |Fri73j . Let Xm be the set of all or- 
thonormal system of m vectors X := {xi, . . . ,Xm} in H. The convoy principle (see 
Lemma 1 in [ Fri73j ) can be stated in the form of Corollary 4.4.3 in [Fbk] . 

sup XiiA{X)) = Xi{A) for i = 1, . . . , m. (4.3) 

The supremum is achieved for X = {ui, . . . ,Um}. This in particular implies the 
Ky-Fan inequalities 



tr(^(X)) = ^(^Xi,Xi) ^ j;^ Ai(A). 



i=l 



Let U = span(xi, . . . ,Xm). Then the eigenvalues of A{X) are the eigenvalues of 
A(JJ). The following approximation result is well known and can be straightfor- 
wardly deduced from the arguments in [Fri73] . 



Lemma 6 Let T-L be an infinite dimensional separable Hilbert space. Let A G 
CS+('H) andJJ be an infinite dimensional closed subspace ofJi. Assume thatJJiji ^ 
1 is an increasing sequence of finite dimensional subspaces in U, such that Uj^iUj 
is dense in U. Then for each j ^ 1, the sequence Xj{A{\Ji)),i ^ l,dimUi ^ j is 
nondecreasing and converges to Xj{A(JJ)). 

Let p > 0. Denote by Wp : Uj ^ R the function Wp{lJ) := tj:A{\jy for 
A € S+('H). The interlacing properties of hermitian matrices yield that Wp is 
a nondecreasing function, i.e. Wp{\J) ^ Wp^V) for U C V. For p > de- 
note by CS_|__p(?^) the subset of all positive compact operators A on V. such that 
tiAP := ^~'i \j{A)P < oo. Clearly CS+,p{n) g CS+,q(1H) for < p < q. 

Lemma 7 Let % be an infinite dimensional separable Hilbert space. Assume 
that p > and A € CS+^p(^). Then the function U i->- tr y4(U)^ is extendable from 
Uf to U. Moreover Wp is a nondecreasing function. 

Proof. Let Uj,i ^ 1 be a sequence of finite dimensional subspaces of U such 
that U^j^Uj is dense in U. Assume that dimUj = nj. Then the positivity of 
A(\Ji) and (gSD imply that ^ Xj{A(\Ji)) ^ Xj{A(U)) ^ Xj{A), for j = 1, . . . ,nj. 
Consider now, for each i ^ 1, the function ipi : {1,2,...} — )• [0, cx)) such that 
fiU) = '^ji^i^i))^ for 1 ^ i ^ "-i and v'i(j) = for j > rij. Consider also the func- 
tion if : {1, 2, ...}—)• [0, oo) such that (/^(j) := Xj{A(JJ))p. Then, by Lemma[6l the 
sequence of nonnegative functions ipi,i ^ 1 converges monotonically to the function 
(^ as z — )■ oo, and then, the monotone convergence theorem implies that ti A({Ji)P = 
Ylj^i^iU) converges, as i ^ oo, to X]i>i¥'(i) = tr A{\J)p ^ tr^^ < oo. Finally, 
the convoy principle implies that w :U —^M. is a nondecreasing function. □ 



Lemma 8 Let 7i be an infinite dimensional separable Hilbert space. Assume 
that A € CS+('H), and that — tr(j41og74) < oo. Then the function w : U i— ?> 
— tr(j4(U) log A(U)) is extendable from 1/1 f to hi. 

Proof. Let Uj,z ^ 1 be as in the proof of Lemma [71 still with rij = dimUj. 
Observe first that the function —x log x is increasing and nonnegative on [0, e^^]. As 
limj_>>oo ^j(^) = 0, we can find an index J such that Xj{A) ^ e~^ for all j ^ j. Let us 
fix such a j. Then, by the convoy principle, we have ^ Xj{A{\Ji)) ^ Xj{A) ^ e~^, 
for all i, and since the sequence Xj{A{\Ji)),i ^ 1 is nondecreasing, it follows that 
the sequence — Aj(A(Ui)) log Aj(^(Uj)),i ^ 1 is nondecreasing and nonnegative. 
Applying the monotone convergence theorem as in the proof of Lemma [71 we deduce 
that 

^ .hm Y, -HM^i)) log A,(A(UO) = Y. -H^^^)) log A,(^(U)) 

3^3 3'^ 3 

^ Y,_-HA)\ogX,{A) , (4.4) 

again with the convention that Aj(Ui) = for j > rii. Since the sum in (j4.4p differs 
from the sum ^j^i —Xj{A) log Xj{A) = — tr(^log(^)) < oo only by a finite number 



of terms, we conclude that the sum in (j4.4p cannot be equal to oo. Since for all 
^ j < J, we also have Xj{A{\Ji)) -^ Xj{A(\J)) as z ^- oo, we finally get 

-oo < lim -tr(A(Ui)logA(Ui)) = - tr(A(U) log^(U)) < oo . 

D 

Theorem 9 Let Ti be an infinite dimensional separable Hilbert space. As- 
sume that p > and A E CS+^p('H). Then the function Wp : U ^ M. given by 
Wp(U) := tr(A(U))P is nondecreasing, it is submodular for p € (0, 1), supermodular 
for p S (1,2] and modular for p = 1. Furthermore, if A ^ CS+('H) is such that 

— tr(^logA) < oo, and in particular, if A £ CS-^- ^p{'H) for some p & (O?!); then the 
function w(U) := — tr(A(U) log j4(U)), U GU is subTuodular. 

Proof. The results for the function Wp{-) follows from Corollary [2] and Lemma 
[71 The submodularity property for the function ■w{-) follows from the same corol- 
lary and Lemma [H Finally, for p G (0,1), (— xlogx)/^^ — )• as x \ 0, and 
since Xj{A) \ as j /^ oo, the convergence of the series tr A^ = J2j ^j{^)^ ™" 
plies the convergence of the series — tr(^log^) = "^^ —Xj{A) log Xj (A). Hence, 

— tr(741og(j4)) < oo holds as soon as A G CS+^p(?{) for some p € (0, 1). □ 



5 CUR approximation of nonnegative definite matrices 

Let A E ([^rnxn |-jg q£ j.g^j^]j J. — j-ank A. Assume that /c G [r — 1]. The best rank k 
approximation of A, denoted as Ak, is given by the singular value decomposition, 
abbreviated here as SVD, |GVL96] . When m and n are very big, e.g. n,m'^ 10^, 
finding A^ is not computationally possible, since the complexity of computing A^ 
is 0{kmn). A good alternative for A^ is the CUR approximation, where C G 
W^^^,R € C'^^" are submatrices of A of the form A[[?tt,], J], j4[/, [n]], where J C 
[n],I C [m] are subsets of columns and rows of A of cardinality k. Here U G 
(^kxk jg g^ suitably chosen matrix. The best choice U* for the Frobenius norm, i.e. 
argmin{/tr((A - CUR){A - CUR)*), is given by C'^AR'^, where B^ denotes the 
Moore-Penrose inverse [FMMNJ . Again, for m,n ^ 1 it is unfeasible to compute 
U*. 

Assume that ^[/, J] G C^^^, the submatrix of A based on the rows and columns 
/, J respectively, is invertible. Then CA[I, J]~^R has the same I rows and J columns 
as A. Hence the best choice of /*, J* seems to be given by [GTZ97J 

fik{A) :=max{|det^[/, J]|,/ C [m],J C [n],\I\ = \J\ = k} = [ det A[/*, J'']]. (5.1) 

More precisely, for A = [a^J^"^^ G ^mxn ^q^ ||A||oo,e := maxjg[„]jg[„j \aij\ be the 
entrywise max-norm of A. Denote by cri{A) ^ ... ^ o'r{A) > the r-positive 
singular values of A. If det A[I, J] ^ one has the inequality |GTZ97] 

\\A - CA[I, J]-'R\Ue ^ ^|^j^(^ + l)^m(^)- (5.2) 

(See |Fbkl §4.13] for a simple proof of the above inequality.) 

In this section we discuss the maximal problem (j5.ip and the related problem of 
CUR approximation where A G Hm,+. 
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Lemma 10 For A G Hm,+ and k S [m], the maximum in (j5.ip is achieved for 
I" = J*, i.e. 

fj.k{A) = max detyl[/]. (5.3) 

/C[m],|/|=A: 

Proof. Let A = [ojj]™ •^^. We first consider the easy case k = 1. Recall that 
an ^ and each 2 principal minor of A is nonnegative. Hence auaij — |ajjp ^ 0. 
Thus max(aii,Ojj) ^ \aij\ = \aji\. Hence ()5.3p holds for k = 1. Assume that k > 1. 

Let Ck{A) = a'' A £ c(™)''(™) be the A;-th compound of A. (The entries of Ck{A) 
are all detA[/, J], |/| = \J\ = k arranged in a lexicographical order.) Recall that 
Ck{A) G H('m\ , . Hence the previous argument applied now to any 2x2 principal 
minor of the matrix Ck{A) yields (|5.3p . □ 

We now introduce a simple greedy algorithm for finding an approximation of 
/ifc(j4) for A € Hm,+ and the corresponding row index /. 

Algorithm 11 

Input.- A = [ttij] G Hm,+, k e [m], I = (I), K = [m],t = 1 
Output.- A subset I of [m] of cardinality k and dety4[/](= t) 
Find i (z K an = max j^k cljj. 

If an = set I = [/c] , t = exit 

/ = / U {i}, K = K\ {i}, t = tan 

If |/| = k exit 

For p,qeK apg = apg - ^^ 

To analyse this algorithm, assume for the simplicity of argument that det ^4 > 0. 
Then, we may introduce the submodular function w{I) := tr(log^[/]), and consider, 
as in the introduction, the maximum Ukiw) of the function w over all subsets J C [m] 
of cardinality A;, so that Vkiw) = log^fc(^)- 

Algorithm II 1 1 should be compared with the standard greedy algorithm |NWF78] 
to approximate the maximum Uk{w), which consists in constructing a sequence of 
sets % = I^ d I^ CI ■ ■ ■ <Z I^ CI [m] such that at each step r, /'' = F~^yj{ir} where the 
index i^ is chosen to so that w{P~^\J{ir}) = maxjg[^]\^7r-i w{T''^'^\J{i}). We denote 
by V {w) := w{I ) the value of the solution I returned by this greedy algorithm. 
Algorithm [11] is nothing but an implementation of this greedy algorithm, in which 
at each step, the augmenting index ir is obtained as a maximal pivot (third line of 
the algorithm) and the value w{V) is obtained incrementally from w{r'~^) by one 
Gaussian elimination step (fifth and seventh lines of the algorithm). In particular, 
denoting by fJ-'^{A) the value returned by Algorithm [TTl we get f^(w) = log^'^{A). 

Assume now that w is nondecreasing. Then, it is straightforward to show that 
if m ^ 2, then det A[J] ^ 1 for each J C [m], so that w is nonnegative. Then, the 
result of |NWF78] can be applied to w, showing that the greedy algorithm has an 
approximation factor 1 — e~^, i.e., i^^{w) ^ (1 — e~^)h'k{w). It follows that the set 
/ returned by Algorithm [11] satisfies 

det^[/] = f,^{A) ^ fik{A)'-'~' . (5.4) 

Therefore, we arrive at the following estimate of the CUR approximation error of 
the greedy algorithm. 
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Proposition 12 Let m ^ 2, and let A € Tim,+ be such that w(/) := trlogj4[/] 
is nondecreasing. Then, the subset I returned by Algorithm\ll\ satisfies: 

\\A - A[[m],I]A[I]-^A[[mllY\\^,e ^ {k + l)fik{Ay~" ak+i{A). 
Proof. Combine Inequality (j5.4p with Inequality (|5.2p . □ 

We now give a simple condition for tt; to be a nondecreasing function. 

Lemma 13 Let A £ 11,^,+ . Assume that X^^A) ^ 1. Thenw is a nondecreasing 
function. 

Proof. Let / be a nontrivial subset of m and let i G [m] \ L. As the eigenvalues 
of B := A[L] interlace the eigenvalues of C := A[L U {i}] we deduce that 

date = Yl \j{C) ^ (detS)A|j|+i(C) ^ {det B)XrniA) ^ detB. 

n 

We conclude this section with an upper estimate of the CUR approximation 
error of yl € 11^,+ using Algorithm [TTJ This estimate relies on the Hadamard 
determinant inequality and remains valid even when w is not nondecreasing. 

Proposition 14 Let A G iim,+ o-nd fc € [m — 1]. Assume that k ^ rank A. Let 
I be any subset of [m] of cardinality k returned by Algorithm\ll\ Let ai, . . . ,ak be 
the k maximal entries out of the m diagonal entries of A. Then 

\\A-A[[m],L]A[I]-U[[m],I]*\U,^^^^{k + l)ak+i{A). 

Proof. Let /* be such that HkiA) = det^[/*]. The Hadamard determinant in- 
equality yields that det^[/*] ^ Hje/* ^jj- Clearly Hie/* "ii ^ oi • • • Ofc- Use (|5.2p 
to deduce the lemma. □ 



6 Examples and further comments 

We now give counter-examples showing that several natural generalizations of the 
present results do not hold. In particular, we shall see that the assumptions in 
Corollary [2] under which the function / i— > Tp(L,A) := tr A[L]P is sub or super- 
modular, for a, m X m nonnegative definite matrix A, are tight. 

To this end, we shall consider a hermitian mxm matrix B with a block partition 

of the form 

Li 

Bu 
B = L2 \ BI2 

-^13 
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so that [m] is the disjoint union of the sets Li,L2,I/3. Given /, J C [m], it is 
convenient to set 

6{f,I,J,B):=tTf{B[I]) + tTf{B[J])-trf{B[IUJ])-tTf{B[InJ]) . 

Hence, 5{f, I, J,B) ^0 for all /, J whenever / i— )• tr f{B[I]) is submodular. 
We now choose 

I := L1UL2, J:=L2UL3 . 

Using the interpretation of the matrix product in terms of concatenation of paths, 
we readily get 

5{t^,I,J,B) = -2tr(Bi3B*3) ^ . (6.1) 

Moreover, the inequality is strict as soon as Bi^ ^ 0. Since 5(1^,1, J, B) depends 
continuously of p, we get the following proposition, in which one can take A = B. 

Proposition 15 For all m ^ 3, and for p close enough to 2, the strict super- 
modularity inequality for Tp{- , A) may hold. □ 



In order to give examples in which the strict opposite inequality may hold, we 
observe that 

Bi3 = ^ 6{t\l,J,B) = and 5{t'^,I,J,B) = -Atr Bl2Bi2B23B^^ . (6.2) 

Note that 5{t^, I, /, B) < 0, unless the nonnegative definite matrix i?i2 -823-623-6^^2 is 
zero. We assume in the sequel that this is not the case. 
Let us consider 

A = ld + sB . 

Using (j6.ip and (|6.2p . together with the trivial fact that d{t, /, J, B) = 0, and tr(/ + 
sB)P = tr/ + pstr-B + • • • + p(p-i)(p-2)(p-3) g4^^^4 _^ Q^^^b^^ ^^ identify the first 
terms of the Taylor expansion of 5(t^, I, J, A) in s, 

6{tP, /, J, A) = _ P(p-l)(P^y2)(p-3) ^4 1^ BuB*^^B23B^^ + 0{s^) (6.3) 

For s sufficiently small, the sign of the latter expression is positive for p € (2,3), 
and negative for p > 3 or p < 0. 

Proposition 16 For all m ^ 3, and for all p G (2, 3) (resp p < or p > 3) the 
strict submodularity (resp. supermodularity) inequality for Tp{- , A) may hold. □ 



Remark 17 The absence of systematic inequalities, except for p £ [0, 1] or 
p € [1,2] is related to the absence of majorization properties. Indeed, consider 
again the matrix B above, assume that it is of size 3, so that the blocks B12 and 
B23 are scalars, and assume that -B13 = 0, so that B is a Jacobi matrix. We choose 
again / = {1, 2}, J = {2, 3}. The spectra of the matrices B, B[I], B[J], B[I n J] are 



UBf, + Bl.„0,-JBf, + Bl,), {\Bi2\,-\Bi2\), {\B23\,-\B23\), (O) 
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Consider A := Id + B, so that A > for 1 > \ABi2+^23- "^^^ joint sequences of 
the eigenvalues of A with A[I n J] and of A[I] with A[J] are 



"12 



+ Bi^+l,l,l, 



Bh + Bls + l), (|i?l2| + l,|i?23l + l,-|i?23l + l,- 



\B 
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1 + 1). 



Note that for B12B23 i^ none of the joint eigenvalue sequences majorizes the other 
one. This example should be compared with a theorem of Bapat and Sunder |BS85| . 
concerning the special case in which / n J = 0: then, the sequence obtained by 
concatenating the eigenvalues of A\I\ and A\J\ is majorized by the sequence of 
eigenvalues of A\l U J], and so, the inequality (II. 4p for the map / 1— )• tr ^[7]^ holds 
for for ah p € (-00, 0) U [1, 00) if / n J = 0. 

Example 18 There are negative values of p for which Tp(-, A) is not supermod- 
ular. Consider for instance the positive definite matrix 



I = {1, 2}, J = {1, 3}, (5(t"S /, J, A) = 16/35 > 



The following counter example shows that Theorem [T] does not carry over to 
operator convex maps. 



5 


-12 


9 


2 


33 


-24 


9 


-24 


19 



Example 19 The map f{t) = t^{\+t) ^ is known to be operator convex |Bha971 
Problem V.5.5]. Consider the positive definite matrix 



A = 

Then, 5{f,I,J,A) 
supermodular. 



-2 
6 
4 



A = l, 



/ = {1,2},J = {1,3} . 

44/1085 > 0, showing that the map K ^ ti f{A[K]) is not 



We now summarize in Table[T]the submodularity and supermodularity properties 
we know, omitting the proofs of the two easy cases. 



p = —1, m ^ 3 

p < 0, m ^ 3 

p < 0, m = 2 

O^p^ 1 

1 ^p^ 2 

p > 2, m = 2 

p = 2, m ^ 3 

pG (2,3), m^3 

p > 3, m ^ 3 



may not be supermodular 

may not be submodular 

supermodular 

submodular 

supermodular 

supermodular 

may not be submodular 

may not be supermodular 

may not be submodular 



Ex.dl] 
Prop. [TBI 
Omitted 
Coro. [2] 
Coro. [2] 
Omitted 

Prop, [m 

Prop. US 
Prop. [TBI 



Table 1: Summary of submodularity and supermodularity properties of the spectral 
function Tp{I,A) = tr A[/]^ for am x m nonnegative definite Hermitean matrix A. 

Recall that for A,B^ Hm, and A ^ B we have that ti A ^ tri?. Observe also 
that Hm(£') is a convex subset of Hm,. Since the primitive of an operator monotone 
function is operator convex we deduce from Theorem [T) 
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Corollary 20 Let f satisfies the assumptions of Theorem [7J Then for each 
I C [m\ the function A i— > tr/(yl[/]) is a convex function on Hm(<?)- n 



Remark 21 A different submodularity inequality involving spectral functions 
appeared in |BGS08j . with an application to an experiment design problem. It is 
shown there that for all nonnegative definite hermitian matrices A, B, C, and for all 
^ p < 1, tr(^ + B + Cy + tvCP f^ tT{A + C)P + tv{B + Cf . 
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